Investigating the impact of data scaling on the k-nearest neighbor algorithm

نویسندگان

چکیده

This study investigates the impact of data scaling techniques on performance k-nearest neighbor (KNN) algorithm using ten different datasets from various domains. Three commonly used techniques, min-max normalization, Z-score, and decimal scaling, are evaluated based KNN algorithm's in terms accuracy, precision, recall, F1-score, runtime, memory usage. The aims to provide insights into applicability effectiveness contexts, aid design implementation machine learning systems, help identify strengths weaknesses each technique their suitability for specific types data. results show that significantly affects algorithm, choice method can have significant implications practical applications. Moreover, three varies across datasets, suggesting should be made characteristics Overall, this provides a comprehensive analysis practitioners researchers community make informed decisions when designing implementing systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the impact of morphological awareness on the vocabulary development of the iranian efl students

this study investigated the impact of explicit instruction of morphemic analysis and synthesis on the vocabulary development of the students. the participants were 90 junior high school students divided into two experimental groups and one control group. morphological awareness techniques (analysis/synthesis) and conventional techniques were used to teach vocabulary in the experimental groups a...

15 صفحه اول

Drought Monitoring and Prediction using K-Nearest Neighbor Algorithm

Drought is a climate phenomenon which might occur in any climate condition and all regions on the earth. Effective drought management depends on the application of appropriate drought indices. Drought indices are variables which are used to detect and characterize drought conditions. In this study, it was tried to predict drought occurrence, based on the standard precipitation index (SPI), usin...

متن کامل

the impact of skopos on syntactic features of the target text

the present study is an experimental case study which investigates the impacts, if any, of skopos on syntactic features of the target text. two test groups each consisting of 10 ma students translated a set of sentences selected from advertising texts in the operative and informative mode. the resulting target texts were then statistically analyzed in terms of the number of words, phrases, si...

15 صفحه اول

the impact of peer review on efl reviewers writing proficiency

امروزه تصحیح همکلاسی در کلاسهای نگارش یکی از اجزاء لاینفک کلاسهای دانش آموز محور است. تاثیرات مفید تصحیح همکلاسی بر زبان آموزان، معلمان را متقاعد کرده است که علیرغم صرف زمان، انرژی و توان بسیار، از این شیوه ی آموزشی در کلاسهای آموزش نگارش بهره بگیرند. تحقیق حاضر بر آن است تا با مقایسه دو گروه از یادگیرندگان زبان انگلیسی، تاثیر تصحیح همکلاسی را بر توانایی نوشتاری آنها نشان دهد. 122 خانم زبان آمو...

15 صفحه اول

k-Nearest Neighbor Classification on Spatial Data

Classification of spatial data streams is crucial, since the training dataset changes often. Building a new classifier each time can be very costly with most techniques. In this situation, k-nearest neighbor (KNN) classification is a very good choice, since no residual classifier needs to be built ahead of time. KNN is extremely simple to implement and lends itself to a wide variety of variatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computer Science and Information Technologies

سال: 2023

ISSN: ['2722-323X', '2722-3221']

DOI: https://doi.org/10.11591/csit.v4i2.p135-142